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This file includes Supplementary Figures 1 to 16 



Supplementary Figure 1. Correlation between Fst and ADAF a set of 5000 random sites from 
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Supplementary Figure 2 . ADAF power to detect selection in simulated data. (See manuscript 
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Supplementary Figure 3. Number of HighD sites according to different ADAF threshold and 
windows size. 
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Supplementary Figure 4. Manhattan plots of rank p-values for cvDAFs at continental and 
populations levels. Chromosomes are represented by alternating turquoise and grey colors; red, 
blue and black dots represent INDEL, SNP and SV sites, respectively, that have been identified 
as low differentiated (LowD sites). 
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Supplementary Figure 5. Number of LowD sites according to different ADAF thresholds and 
windows sizes. 
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Supplementary Figure 6. Validation results in independent HapMap samples. 
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Supplementary Figure 7. Expected and observed number of HighD sites under different 
conditions of window sizes and ADAF thresholds. 
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Supplementary Figure 8. Sensitivity to migration rate (m_rate) of the expected number of 
HighD sites from simulations under neutrality and comparison of two different demographic 
models, (a) Simulations of the AFR-EUR and ASN-EUR comparisons under the model proposed 
by Gravel and colleagues (reference number XX in the main text) as a complement to what 
presented in the main text. Dashed line represents number of observed HighD sites, (b) 
Comparison of allele frequency spectra of 1000 Genome data (red bars, sims_g=reference XX, 
sims_s=reference XX ) and (c ) number of expected HighD sites for simulated data under the 
model quoted above and another model (references X in the main text) 
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Supplementary Figure 9. Features of genomic sites matched for allele frequency and distance 
from gene. 
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Supplementary Figure 10. Combined p-value for Tajima's D, Fay&Wu's H and Nielsen's CLR 
values in HighD sites and matched controls. Each column refer to a subset of HighD sites that 
have highest DAF in the population indicated in the head of the column; population indicated in 
rows are the population in which the combined p-values has been calculated for the set of sites 
in the column. 
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Supplementary Figure 11. Simulated data showing the frequency of the most common 
haplotype in 2kb surrounding a site under positive selection for a range of selective pressures 
leading to different final allele frequency at the selected site (on the x-axis). For hard sweeps 
(final allele frequency=l ) there is mostly a single haplotype 
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Supplementary Figure 12. Boxplots of Levenshtein distance from the major haplotype of all 
other haplotypes in 2kb surrounding sites under positive selection in simulated data. In the 
table below average values relative to boxplots. FaF=final allele frequency of the site under 
selection 
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Supplementary Figure 13. Frequencies of haplotypes in 2 kb surrounding HighD sites 
demonstrated to be functional in examples of positive selection. 
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Supplementary Figure 14. Boxplots showing the distribution of weighted Levensthein distance 
in HighD sites and in matched controls for continental comparisons. 




Supplementary Figure 15. Fraction of HighD and matched genomic sites with haplotypic 
features similar to sites accepted as examples of classic selective sweeps. 
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Supplementary Figure 16. Functional annotations in the genomic region surrounding the HighD 
site in CALD1 and median joining network of haplotypes surrounding the site. Haplotypes are 
derived from sites in linkage disequilibrium (D'=l) with the HighD site in JPT populations. 
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